38 research outputs found

    Joint 3D Proposal Generation and Object Detection from View Aggregation

    Full text link
    We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion on high resolution feature maps to generate reliable 3D object proposals for multiple object classes in road scenes. Using these proposals, the second stage detection network performs accurate oriented 3D bounding box regression and category classification to predict the extents, orientation, and classification of objects in 3D space. Our proposed architecture is shown to produce state of the art results on the KITTI 3D object detection benchmark while running in real time with a low memory footprint, making it a suitable candidate for deployment on autonomous vehicles. Code is at: https://github.com/kujason/avodComment: For any inquiries contact aharakeh(at)uwaterloo(dot)c

    Estimating Regression Predictive Distributions with Sample Networks

    Full text link
    Estimating the uncertainty in deep neural network predictions is crucial for many real-world applications. A common approach to model uncertainty is to choose a parametric distribution and fit the data to it using maximum likelihood estimation. The chosen parametric form can be a poor fit to the data-generating distribution, resulting in unreliable uncertainty estimates. In this work, we propose SampleNet, a flexible and scalable architecture for modeling uncertainty that avoids specifying a parametric form on the output distribution. SampleNets do so by defining an empirical distribution using samples that are learned with the Energy Score and regularized with the Sinkhorn Divergence. SampleNets are shown to be able to well-fit a wide range of distributions and to outperform baselines on large-scale real-world regression tasks.Comment: Accepted for publication in AAAI 2023. Example code at: https://samplenet.github.io

    Self-Supervised Image-to-Point Distillation via Semantically Tolerant Contrastive Loss

    Full text link
    An effective framework for learning 3D representations for perception tasks is distilling rich self-supervised image features via contrastive learning. However, image-to point representation learning for autonomous driving datasets faces two main challenges: 1) the abundance of self-similarity, which results in the contrastive losses pushing away semantically similar point and image regions and thus disturbing the local semantic structure of the learned representations, and 2) severe class imbalance as pretraining gets dominated by over-represented classes. We propose to alleviate the self-similarity problem through a novel semantically tolerant image-to-point contrastive loss that takes into consideration the semantic distance between positive and negative image regions to minimize contrasting semantically similar point and image regions. Additionally, we address class imbalance by designing a class-agnostic balanced loss that approximates the degree of class imbalance through an aggregate sample-to-samples semantic similarity measure. We demonstrate that our semantically-tolerant contrastive loss with class balancing improves state-of-the art 2D-to-3D representation learning in all evaluation settings on 3D semantic segmentation. Our method consistently outperforms state-of-the-art 2D-to-3D representation learning frameworks across a wide range of 2D self-supervised pretrained models.Comment: Accepted in CVPR 202

    Determinants of response at 2 months of treatment in a cohort of Pakistani patients with pulmonary tuberculosis

    Get PDF
    Mycobacterium tuberculosis infection continues to be a major global challenge. All patients with pulmonary tuberculosis are treated with a standard 6-month treatment regimen. Historical data suggest that even with shortened treatment, most patients achieve long-term remission. Risk stratification is a goal for reducing potentially toxic prolonged treatment. This study aimed to determine the factors associated with the early clearance of sputum acid-fast bacilli (AFB). A total of 297 freshly diagnosed patients with pulmonary tuberculosis were included and enrolled in this study. Information related to their ethno-demographic and anthropometric characteristics was collected. We also assessed their complete blood counts, and blood iron, folate, and vitamin B12 levels. We found that the presence of higher levels of acid-fast bacilli (AFB) in diagnostic sputum microscopy was the single most significant prognostic factor associated with early clearance of sputum AFB after 2 months of treatment. All of our patients achieved treatment success after 6 months of treatment and were disease free. Our results support the data obtained from previous studies indicating that AFB clearance at 2 months is unlikely to be a clinically useful biomarker or indicator for therapeutic stratification. Furthermore, demographic, anthropometric, and nutritional factors are not clinically useful biomarkers
    corecore